79 results found.
Written
Corpus,
Language Type:
Multilingual
Languages:
Afrikaans Albanian Amharic Arabic Aragonese Armenian Assamese Azerbaijani Basque Belarusian Bengali Bosnian Breton Bulgarian Burmese Catalan Central Khmer Chinese Croatian Czech Danish Dutch Dzongkha English Esperanto Estonian Finnish French Gaelic Galician Georgian German Greek Gujarati Hausa Hebrew Hindi Hungarian Icelandic Igbo Indonesian Irish Italian Japanese Kannada Kazakh Kinyarwanda Korean Kurdish Kyrgyz Latvian Limburgan Lithuanian Macedonian Malagasy Malay Malayalam Maltese Marathi Mongolian Nepali Northern Sami Norwegian Norwegian Bokmål Norwegian Nynorsk Occitan Oriya Panjabi Pashto Persian Polish Portuguese Romanian Russian Serbian Serbo-Croatian Sinhala Slovak Slovenian Spanish Swedish Tajik Tamil Tatar Telugu Thai Turkish Turkmen Uighur Ukrainian Urdu Uzbek Vietnamese Walloon Welsh Western Frisian Xhosa Yiddish Yoruba Zulu
Availability:
Freely Available
License:
Size:
55 million sentences Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Improving Massively Multilingual Neural Machine Translation and Zero-Shot Translation
-
Paper track:Long/Machine Translation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Biao Zhang | the open parallel corpus (OPUS) | /N |
Documentation:
None
Not Applicable
Contextualsed word embeddings,
Language Type:
Monolingual
Languages:
Ancient Arabic Basque Bokmål Bulgarian Catalan Chinese Church Croatian Czech Danish Dutch English Estonian Finnish French Galician German Greek Hebrew Hindi Hungarian Indonesian Irish Italian Japanese Korean Latin Latvian Norwegian Nynorsk Old Persian Polish Portuguese Romanian Russian Simplified Chinese Slavonic Slovak Slovene Spanish Swedish Turkish Ukrainian Urdu Uyghur Vietnamese
Availability:
Freely Available
License:
none
Size:
18.4 GByte Production Status:
Existing-used
Use:
Parsing and Tagging
-
Paper title:Treebank Embedding Vectors for Out-of-domain Dependency Parsing
-
Paper track:Short/Syntax: Tagging, Chunking and Parsing
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Joachim Wagner | Elmo For Many Languages | /N |
Documentation:
https://www.aclweb.org/anthology/K18-2005/
Speech
Corpus,
Language Type:
Monolingual
Languages:
Danish Dutch English French German Norwegian Swedish
Availability:
From Owner
License:
The copyright holders for the individual languages are: Danish Tele Danmark, Jydsk Telefon, Denmark Dutch Royal PTT Nederland NV (KPN), TNO Human Factors Research Institute, Soesterberg, The Netherlands English University College London, United Kingdom French CNRS / INPG (ICP), France German Universitat Bielefeld, Germany Norwegian The Norwegian Institute of Technology, SINTEF DELAB and Telenor Research, Norway Swedish Dept of Speech Communication and Music Acoustics, KTH, Sweden
Size:
5 CMROMs pr language OtherProduction Status:
Existing-used
Use:
Evaluation/Validation
-
Paper title:Harmonic beamformers for non-intrusive speech intelligibility prediction
-
Paper track:6.6 Speech intelligibility/Poster Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Charlotte Sørensen | EUROM_1 | /N |
Documentation:
Publicly available documentation in English
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
Arabic Catalan Chinese Dutch Estonian French German Indonesian Italian Japanese Latvian Mongolian Persian Portuguese Russian Slovenian Spanish Swedish Tamil Turkish Welsh
Availability:
Freely Available
License:
CC0
Size:
2880 hoursProduction Status:
Newly created-in progress
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:CoVoST 2 and Massively Multilingual Speech Translation
-
Paper track:12.1 Spoken machine translation/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Juan Pino | CoVoST 2 | /N |
Documentation:
None
Speech
Speech corpus,
Language Type:
Monolingual
Languages:
Swedish
Availability:
From Owner
License:
Size:
2089 sentencesProduction Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:"You don't understand me!": Comparing ASR results for L1 and L2 speakers of Swedish
-
Paper track:10.2 Applications in education and learning (incl./Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Ronald Cumbal | Ville | /N |
Documentation:
None
Speech
Speech corpus,
Language Type:
Monolingual
Languages:
Swedish
Availability:
From Owner
License:
Size:
1610 sentencesProduction Status:
Newly created-in progress
Use:
Speech Recognition/Understanding
-
Paper title:"You don't understand me!": Comparing ASR results for L1 and L2 speakers of Swedish
-
Paper track:10.2 Applications in education and learning (incl./Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Ronald Cumbal | CORALL | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Multilingual
Languages:
Danish Norwegian Swedish
Availability:
Freely Available
License:
Size:
None Production Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:Transparent pronunciation scoring using articulatorily weighted phoneme edit distance
-
Paper track:10.2 Applications in education and learning (incl./Poster Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Reima Karhila | Spraakbanken | /N |
Documentation:
None
Written
Lexicon,
Language Type:
Monolingual
Languages:
Swedish
Availability:
Freely Available
License:
CC BY-NC-SA 4.0
Size:
15619 entries Production Status:
Existing-used
Use:
Language Learning/Grading
-
Paper title:Using Multilingual Resources to Evaluate CEFRLex for Learner Applications
-
Paper track:Evaluation/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Johannes Graën | SVALex | /N |
Documentation:
see publications
Written
Lexicon,
Language Type:
Multilingual
Languages:
English French Swedish
Availability:
Freely Available
License:
CC BY-NC-SA 4.0
Size:
41425 entries Production Status:
Newly created-finished
Use:
Language Learning/Grading
-
Paper title:Using Multilingual Resources to Evaluate CEFRLex for Learner Applications
-
Paper track:Evaluation/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Johannes Graën | multiCEFRLex | /N |
Documentation:
see this publication
Written
Lexicon,
Language Type:
Monolingual
Languages:
Swedish
Availability:
Freely Available
License:
CC-BY-SA 3.0, LGPL 3.0
Size:
8425 entries Production Status:
Existing-used
Use:
Language Learning/Grading
-
Paper title:Using Multilingual Resources to Evaluate CEFRLex for Learner Applications
-
Paper track:Evaluation/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Johannes Graën | Swedish Kelly list | /N |
Documentation:
see website




